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IN THE CLAIMS: 

The text of all pending claims, (including withdrawn claims) is set forth below. Cancelled 
and not entered claims are indicated with claim number and status only. The claims as listed 
below show added text with underlining and deleted text with strik e through . The status of each 
claim is indicated with one of (original), (currently amended), (cancelled), (withdrawn), (new), 
(previously presented), or (not entered). 

Please AMEND claims 1 , 7, 8 and 16-21 in accordance with the following: 

1 . (currently amended) A method for collecting documents linked to each other from a 
network by crawling the network, comprising: 

collecting first documents equal to or larger, in number, than a predetermined value from 
inside a community through the network based on a reference relation of the first documen ts, the 
reference relation defining a relationship between the first documents and second documents 
inside or outside the community linked to the first documents : and 

collecting third documents from inside and outside the community based on the 
reference relation of oo lle ot e d the first documents after collecting the first documents e qua l to or 
larg e r in numb e r than th e pr e d e t e rm i n e d va l u e f rom inside the community. 

2. (currently amended) The method according to claim 1, further comprising: 
computing a significance level indicating a level of significance of tbe-a_collected 

document according to the reference relation of the collected document, and information about a 
position of the collected document in the network; and 

determining a new document to be collected based on the reference relation and the 
significance level. 

3. (currently amended) The method according to claim 2, wherein said collecting of the 
third documents to b e co lle ct e d is determined separately for inside the community and for 
outside the community. 

4. (currently amended) The method according to claim 3, further comprising: 
presenting a result of retrieving the co lle ct e d third documents separately for inside the 

community and outside the community. 
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5. (currently amended) The method according to claim 2, further comprising: 
determining whether eHFtet -anv of the documents is in the community according to 

information indicating the position of the document in the network. 

6. (cancelled) 

7. (currently amended) A method for collecting documents linked to each other from a 
network by crawling the network, comprising: 

providing a positive sample document group wh i ch is a docum e nt group relating to a 
field, and a negative sample document group wh i ch i s a docum e nt group le ss related to the field; 

determining a prospective document wh i ch i s to b e co lle ct e d and for collection that is 
related to the field based on a reference relation to the positive sample document group and the 
negative sample document group by computing a reference score indicating a level at which a 
the prospective document is referenced only by a -at least one document in the positive sample 
document group based on the reference relation which defines a relationship between original 
documents and linked documents belonging to the positive sample document group or to the 
negative sample document group which are linked to the original documents : and 

collecting a -the prospective document having a high reference score as the docum e nt to 
b e co lle ct e d . 

8. (currently amended) A method for collecting documents linked to each other from a 
network by crawling the network, comprising: 

providing a positive sample document group wh i ch i s a docum e nt group relating to a 
field, and a negative sample document group wh i ch i s a docum e nt group less related to the field; 

determining a prospective document wh i ch i s to b e co lle ct e d and for collection that is 
related to the field based on a reference relation to the positive sample document group and the 
negative sample document group by computing a co-reference score indicating a level at which 
a -the prospective document is referenced together with a first collected document in the positive 
sample document group for a second collected document referenced by a third collected 
document referring to a fourth collected document in the positive sample document group based 
on the reference relation which defines a relationship between original documents and linked 
documents belonging to the positive sample document group or to the negative sample 
document group which are linked to the original documents : and 
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collecting a -the prospective document having a high co-reference score as th e documont 
to b e coll e ct e d . 

9. (cancelled) 

10. (previously presented) The method according to claim 1, further comprising: 
summarizing the collected documents based on a referencing expression used in the 

collected documents. 

11 . (previously presented) The method according to claim 1 , further comprising: 
assigning a keyword to the collected documents based on a referencing expression used 

in the collected documents. 

12. (original) The method according to claim 1 , further comprising: 

not assigning a keyword based on the referring expression when the referencing 
expression is used regardless of a content of a referenced document. 

13. (original) The method according to claim 11 , further comprising: 

counting a number of different documents referenced using the referencing expression; 

and 

not assigning the keyword based on the referencing expression when the number of 
different documents is equal to or larger than a predetermined value. 

14. (original) The method according to claim 11 , further comprising: 

counting a reference frequency at which each collected document is referenced by the 
referencing expression when the number of different documents is smaller than a predetermined 
value; and 

determining whether or not the referencing expression is assigned as the keyword based 
on the number of different documents and the reference frequency. 

15. (original) The method according to claim 11 , further comprising: 

combining the keyword based on the referencing expression with a keyword extracted 
from text of the collected document, and a keyword extracted from information indicating a 
position in the network about the collected document. 
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16. (currently amended) A method for retrieving documents linked to each other from a 
terminal belonging to a community in a network by crawling the network, comprising: 

transmitting information for retrieval of the documents to a server; and 
receiving the documents retrieved separately from inside and outside the community 
according to the information for retrieval together with information indicating a significance level 
for the communit y, the significance level indicating a relationship between original documents 
and other documents inside or outside the community which are linked to the original 
documents . 

17. (currently amended) A document collection apparatus collecting documents linked 
to each other from a network by crawling the network, comprising: 

a next prospect determination unit determining a prospect to be collected next based on 
a reference relation of a collected documen t the reference relation defining a relationship 
between original documents and other documents inside or outside the community which are 
linked to the original documents : 

a community determination unit determining whether or not the prospect is in a 
community in the network according to information indicating a position in the network of the 
prospect; and 

a document collection unit collecting the prospect from the network, wherein said 
document collection unit collects the prospect from inside and outside the community after 
collecting documents larger in number than a predetermined value from inside the community. 

18. (currently amended) A document collection apparatus collecting documents linked 
to each other from a network by crawling the network, comprising: 

a next prospect determination unit determining a prospect to be collected next based on 
a reference relation between a positive sample document group which is a document group 
related to a field and a negative sample document group which is a document group less related 
to the field; and 

a document collection unit collecting the prospect from the network : wherein the 
reference relation is a relationship between original documents and other documents belonging 
to the positive sample document group or to the negative sample document group which are 
linked to the original documents . 
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19. (currently amended) A computer-readable recording medium recording a program 
used to direct a computer to control collection of documents linked to each other from a network 
by crawling the network, comprising: 

collecting first documents equal to or larger, in number, than a predetermined value from 
a community through the network based on a reference relation of the first document s, the 
reference relation defining a relationship between the first documents and second documents 
inside or outside the community which are linked to the first documents : and 

collecting third documents from inside and outside the community based on the 
reference relation of oo l l e oted the first documents after said collecting ofthe first documents 
equal to or larger, in number, than the predetermined value from inside the community. 

20. (currently amended) A computer-readable recording medium recording a program 
used to direct a computer to control collection of documents linked to each other from a network 
by crawling the network, comprising: 

providing a positive sample document group which is a document group relating to a 
field, and a negative sample document group which is a document group less related to the field; 

determining a document to be collected relating to the field based on a reference relation 
to the positive sample document group and the negative sample document group , the reference 
relation defining a relationship between original documents and other documents inside or 
outside the community which are linked to the original documents : and 

collecting the document to be collected from the network. 

21 . (currently amended) A computer data signal embodied on a carrier expressing a 
program used to direct a computer to control collection of documents linked to each other from a 
network by crawling the network, said program instructing the computer to perform the process 
comprising: 

collecting first documents equal to or larger than, in number, a predetermined value from 
inside a community in the network based on a reference relation of the first documents , the 
reference relation defining a relationship between the first documents and second documents 
inside or outside the community which are linked to the first documents : and 

collecting third documents from inside and outside the community based on the 
reference relation of co lle ct e d the first documents after said collecting of the first documents 
equal to or larger, in number, than the predetermined value from the community. 
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